• 



33 near-duplicate of a previously processed 

34 document, then associating the document 

35 corresponding to the candidate search result with 



36 a cluster identifier associated with the 

3 7 previously processed document. 




Please cancel claims 14, 15, 30-33 and 39 without 
prejudice to, or disclaimer of, the subject matter recited 
therein. 



In accordance with 37 C.F.R. § 1 . 121 (c) (1) (ii) , 
separate sheets with the rewritten claim marked-up to show 
the changes made to the previous version of the claim, is 
filed herewith. 

REMARKS 

In view of the foregoing amendments and the following 
remarks, the applicants respectfully submit that the 
pending claims are not anticipated under 35 U.S.C. § 102. 
Accordingly, it is believed that this application is in 
condition for allowance. If, however, the Examiner 
believes that there are any unresolved issues, or believes 
that some or all of the claims are not in condition for 
allowance, the applicants respectfully request that the 
Examiner contact the xindersigned to schedule a telephone 
Examiner Interview before any further actions on the 
merits . 

The applicants will now address each of the issues 
raised in the outstanding Office Action. 





Objections 



Claim 16 is objected to as being dependent upon a 
rejected base claim, but was found to include allowable 
subject matter. Claim 16 has been rewritten in independent 
form to include the elements of base claim 14 and 
intervening claim 15. Accordingly, this objection to claim 
16 should be withdrawn. 

Rejections xinder 35 U,S,C, § 102 

Claims 14, 15, 3 0-33 and 3 9 stand rejected under 3 5 
U.S.C. § 102(e) as being anticipated by U.S. Patent No. 
6,119,124 ("the Broder patent"). The applicants 
respectfully request that the Examiner reconsider and 
withdraw this ground of rejection in view of the following. 
Since these claims have been canceled, this ground of 
rejection is now moot. 

Conclusion 

In view of the foregoing amendments and remarks, the 
applicants respectfully submit that the pending claims are 
in condition for allowance. Accordingly, the applicants 
request that the Examiner pass this application to issue. 



Respectfully submitted. 



June 12, 2003 
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CERTIFICATE OF MAILING under 37 C,F,R. 1,8 (a) 



I hereby certify that this correspondence is 
being deposited on June 12, 2003 with the United States 
Postal Service as first class mail, with sufficient 
postage, in an envelope addressed to the Commissioner for 
Patents, P.p. /Sb^ 1450, Alexandria, VA 22313-1450. 

Reg. No. 36,242 
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1 

SEPARATE SHEETS WITH MARKED UP VERSION OF CLAIM PER 37 
C.F.R, § 1,121 (c) (1) (iii) 

Claim 16 has been amended as follows: 

1 16. (AMENDED) A method for filtering search results to 

2 remove near-duplicates, the method comprising: 

3 a) for each of a predetermined number of candidate 

4 search results, determining whether the candidate 

5 search result is a near-duplicate of another candidate 

6 search result; and 

7 b) if it is determined that the candidate search 

8 result is a near-duplicate of another candidate search 

9 result, then rejecting the candidate search result 

10 wherein the act of determining whether a 

11 candidate search result is a near-duplicate of another 

12 candidate search result includes 

13 i) comparing a cluster identifier of the 

14 candidate search result with that of the other 

15 candidate search result, and 

16 ii) if the cluster identifiers of the two 

17 candidate search results match, then concluding 

18 that the two candidate search results are 

19 near-duplicates, and 

20 [The method of claim 15] wherein cluster identifiers of the 

21 candidate search results are assigned by: 

22 i) determining whether or not a document 

23 corresponding to the candidate search result is a 

24 near-duplicate of any of previously processed 

25 documents, ^ 

26 ii) if it is determined that the document V) 

27 corresponding to the candidate search result is > 



1 





28 
29 
30 
31 
32 
33 
34 
35 
36 
37 



not a near-duplicate of any previously processed 
document, then associating the document with a 
unique cluster identifier, and 
iii) if it is determined that the document 
corresponding to the candidate search result is a 
near -duplicate of a previously processed 
document, then associating the document 
corresponding to the candidate search result with 
a cluster identifier associated with the 
previously processed document. 
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